A POINT PROCESS DESCRIBING THE COMPONENT SIZES IN 
THE CRITICAL WINDOW OF THE RANDOM GRAPH 

EVOLUTION 

SVANTE JANSON AND JOEL SPENCER 

Abstract. We study a point process describing the asymptotic behavior of sizes 
of the largest components of the random graph G(n,p) in the critical window 
p — rT x + An~ 4,/3 . In particular, we show that this point process has a surprising 
rigidity. Fluctuations in the large values will be balanced by opposite fluctuations 
in the small values such that the sum of the values larger than a small e is almost 
constant. 



1. Introduction 

We consider the asymptotic behavior of the component sizes in the random graph 
G(n,p), where throughout this paper p = n _1 + An~ 4 / 3 for some fixed A with 
— co < A < co. It is well-known that this is the critical window of p where the "phase 
transition" occurs. It is further well-known that, for the p we consider, the largest 
components are of order n 2//3 . We therefore scale by this factor; if the components 
are Ci,C2, . . . ,C r , in order of decreasing size, say, and \Ci\ is the size (order) of Ci, 
we define £ n j to be n~ 2 / 3 |Cj| and consider the random set H n := {£ n j}[ =1 as a point 
process on (0, co) or (0,co]. See Appendix lAl for some technical background and 
note that it is convenient to define the point process formally as a random measure 
with point masses at the points £ n j; we will sometimes use this formalism, writing 
for example H n [a,6] for the number of points in [a,b], but we will also speak (and 
think) of point processes as random sets. 

It follows immediately from Aldous 1, Corollary 2], see Lemma IA.2I that as 
n — > co, the point processes H n converge in distribution to some point process = 
{^ A) } on (0,co] (in the vague topology on (0, co], see Appendix^); this also follows 
from a minor extension of results in Luczak, Pittel and Wierman |18j . see Janson, 
Luczak and Ruciiiski |121 Theorem 5.20]. Aldous further gave a description of the 
limiting process as the set of lengths of excursions of a certain reflected Brownian 
motion with parabolic drift, defined as B x (s) := W x (s) — mino< u < s W x (u), s > 0, 
where W x (s) = W(s) + As — s 2 /2 for a standard Brownian motion W. 

We will usually keep A fixed, and will then often omit it from the notation, thus 



writing H = and & = . Conversely, we may write when necessary. Note 
that we may regard (S( a ))a as a stochastic process indexed by A £ (-co, co); this is 
the standard multiplicative coalescent as constructed by Aldous P , except that the 
variables 3^ are represented as point processes while Aldous uses the equivalent 
representation as sequences (£i)f°; cf. Lemma I A. 21 although Aldous uses a stronger 
topology. 
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The aim of this paper is to study the limiting point process S. The number of 
components in G(n,p) tends to infinity (in probability) as n — > oo, so we expect an 
infinite number of points £j in 3. Moreover, if we say that the weight of a point x 
is x, the total weight of 3 n is Yli £ni = n 1 ^ , so we expect the total weight of S, i.e. 
Yl & = I x *° De infinite a.s.; indeed, this is a simple consequence of Theorem ll.il 
(Still, we caution that results on the limiting process 3 do not automatically follow 
from results on the discrete G(n,p). Had we, for example, chosen the "wrong" 
parameterization = n~°- 7 |Cj| then 3 would be almost surely empty.) 

Our main result is the following. We also give in later sections various other 
results; several of them have been more or less well-known for a long time, but 
perhaps not published previously in this form. 

Theorem 1.1. Let — oo < A < oo, and let 3 be the limiting point process defined 
above. Let Z £ := > £ £« = / e °° xdB{x) be the total weight of all points in 3 that 
are at least e. Then, as e — > 0, 

EZ £ = (-) V V 1 / 2 + A + (27T)- 1 / 2 AV/ 2 + O(e) (1.1) 

and 

Var Z £ = (2/^) 1 / 2 e 1 / 2 + 0(e). (1.2) 
Ln particular, E Z e — > oo and Var Z e —* as e — » 0. 

We will also give an exact, but more complicated, formulas for EZ e in Corol- 
lary 14.21 and Var Z e in Corollary 18.31 It seems non-trivial to obtain the asymptotics 
above from these formulas. 

Thus, as e — * 0, the variables Z e tend to infinity, but they become more and more 
concentrated about their mean; hence, the random fluctuations disappear in the 
limit. In other words, the process 3 is very rigid, and any random fluctuation in the 
weights of the largest points has to be exactly balanced by opposite fluctuations in 
the weights of smaller points; this will be seen again in Section El where we consider 
the Palm distributions. Note that, because of the scaling, this is a non-trivial result 
in contrast to the corresponding fact that 3 n has a constant total weight n 1 / 3 . Note 
also that this is very far from the behaviour of a Poisson process. 

We will prove Theorem 11.11 bv two different methods, both classical, each giving 
a partial result only, in Sections El and H3 

In contrast to Theorem II. 1\ the number of points > e in 3, i.e. S[e, oo), is not 
sharply concentrated. 

Theorem 1.2. Let W e := H[e, oo) be the number of points in 5 that are at least e. 
Then, as e — > 0, 

E We = (A) V2 e -3/2 _ (27r) -l/2 A 2 £ -l/2 + 1 ln(1/£) + 0(1) (L3) 

and 

VarW e = (-?-) 1/2 £- 3/2 + 0(e~ 1 ) ~EW £ . (1.4) 
\9it / 

Remark 1.3. It seems likely that W £ is almost Poisson distributed, in the sense that 
its total variation distance to a Poisson distribution with the same mean tends to 
as e — > 0, but we leave this as an open problem. If this holds, it would immediately 
imply asymptotic normality of W e . 
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The main interest in Theorem 11.11 comes from the fact that Z £ approximatively 
describes the large component sizes in G(n,p) for large n. We formalize this in 
the following intuitively obvious result; see Section [5] for a formal verification of the 
technicalities. 

Proposition 1.4. Let Z n£ = X^ n .> e £m = x dz* n {x) be the total weight of all 
points in E n that are at least e; thus Z n£ equals n~ 2 l 3 times the total size of all 

components >en 2 / 3 in G(n,p). For every fixed e > 0, as n — > oo, Z n£ A Z £ with 
convergence of all moments, i.e., for every q > 0, EZ' E — ► KZ £ . 
The same holds for Z' n£ := Y^£ ni >e Cm- 

Similarly, if W n£ := #{i : £ n j > e} = H n [e, oo), the number of components 
>en 2 / 3 in G(n,p), then W n£ —* W £ , with convergence of all moments. 

We introduce some more notation. Let X n (k) denote the number of components 
with k vertices in the random graph G(n,p), and let Y n (k) := kX n (k), the total 
number of vertices in these components. We further define X n (L) := J2kei X n (k) 
and Y n {I) := ^2i~ £ jY n (k) f° r an interval /. (For simplicity, we omit p from the 
notation.) Thus Z n£ = n~ 2 ^Y n [en 2 / 3 , oo), Z' ne = n~ 2 / 3 Y n (en 2 / 3 , oo), and W n£ = 
X n [en 2 / 3 , oo). We denote falling factorials by n— := n ■ ■ ■ (n — k + 1). 

Remark 1.5. Although we keep A fixed for simplicity, it is easy to see (e.g. using 
the monotonicity of G(n,p) in p) that the Proposition 11.41 holds also for a sequence 
A n — * A. Moreover, as a consequence of this and monotonicity, the same holds for 
the random graph G(n, m) with a deterministic number m = n/2 + (A + o(l))n 2 ' 3 /2 
edges. 

Acknowledgement. We thank David Aldous for interesting discussions. 

2. First partial proof of Theorem 11.11 

If n is a probability distribution on the non-negative integers, let T(fi) denote the 
(random) total progeny of a Galton-Watson process with offspring distribution /i, 
starting with one initial particle. (Thus T(/i) €{l,2,...,oo}.) 

Lemma 2.1. Let A G (— oo, oo) and e > be fixed. 

(i) The limit U £ (X) := lim^oo n 1/3 P(T(Po(l + An" 1 / 3 )) > en 2 / 3 ) exists, and 

{2i:)- l / 2 x- 3 l 2 e-~ x2x/2 dx. (2.1) 

(ii) More generally, if \ n — > X, then 

n l/3p( T ( Po ( 1 + Ann -l/3)) > £n 2/3) ^ 

(iii) Moreover, for any fixed 5 S R and any sequence 5 n — > 5, 

n i/3 p ( T (Bi(Ln-5 n n 2 / 3 J,n" 1 + An" 4 / 3 )) > en 2 / 3 ) -> U £ {\-5). 

Proof, (i): First consider the possibility of an infinite total progeny. By elementary 
branching process theory, if q := P(T(Po(7) = oo)), then q = for 7 < 1, while 
q > for 7 > 1, and then 1 - q = e" 97 , or 7 = - ln(l -q)/q = l + q/2 + 0(q 2 ). It 
follows that if 7 — > 1 with 7 > 1, then g — ► and q ~ 2(7 — 1). Consequently, as 
n — ► 00, for any real A, 

n 1/3 P(T(Po(l + An" 1/3 )) =00) ^2max(A,0). (2.2) 
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Next, consider a finite total progeny > en 2 / 3 . By Otter JH]) see also Pitman |21j . 

P(T(/x) = k) = -r F(Sk = k — 1), 1 < k < oo, (2.3) 

where is the sum of independent random variables with the distribution ji. In 
particular, for a Poisson distribution, using Stirling's formula, 

P(T(Po(l + An" 1 / 3 )) =h) = - P(Po(fc(l + An" 1 / 3 )) = k - l) 

fe^Hl + An- 1 / 3 )^ 1 fcfl+An - 1/3) 
fc! 

= (2vr)- 1 /2 A; - 3 /2(l + ^-l/Sjfcg-fcAn-Va ^ + 0( | A | n -l/3 + . (2 . 4) 

We have 

ln(l + An" 1 / 3 ) = An" 1 / 3 - ^(An" 1 / 3 ) 2 + 0{\ An" 1 / 3 ] 3 ). 
Hence, for en 2 / 3 < k < a n := (n 1 / 3 ) A)" 1 ) 5 / 2 (with a n = oo when A = 0), (|2.4|) yields 

P(T(Po(l + An^ 1 / 3 )) = k) = (2^)- 1 / 2 fc- 3 / 2 e-5 A2fc ™- 2/3 (l + O^ 1 / 6 )) . (2.5) 
Moreover, assuming that n is so large that |An _1 / 3 | < 1/2, 

ln(l + An" 1 / 3 ) < An- 1 / 3 - ^(An" 1 / 3 ) 2 + flAi^ 1 / 3 ! 3 < An" 1 / 3 - iA 2 n~ 2 / 3 
and thus, by (J23J), 

P(T(Po(l + An" 1 / 3 )) = k) = 0(k- 3 / 2 e-i x2kn ~ 2/3 y (2.6) 

Summing over k £ [en 2 / 3 ,a n ) we find, by (|2.5|) . (|2.6|) and dominated convergence, 
n 1 / 3 P(T(Po(l + An- 1/3 )) G [en 2 / 3 ,a n )) 

■ \a 



n 1 / 3 



/ P(T(Po(l + An- 1/3 )) = [x\) dx 

J Ten 2 / 3 ! 

nP(T(Po(l + An" 1/3 )) = [n 2/3 x|) dx 

n -2/3|- £n 2/3-| 

(2vr)- 1 / 2 x- 3 / 2 e- A2:E / 2 C ix. (2.7) 



Furthermore, the sum over A; > a n is exponentially small by (|2.6j) . Hence, the result 
follows by and (t2~7|) . 

(ii): By the same proof as (i), or by (i) and monotonicity. 

(hi): One could use Q2.3|) and argue as above, but we will instead use a Poisson 
approximation. For any N and p, we have the bound on the total variation distance 

d TV (Bi(N,p),Po(Np)) <P, 

see e.g. [21 Theorem 2.M]. Hence, using a maximal coupling of Bi(iV,p) and Po(Np) 
in each family, we can couple the Galton-Watson processes with offspring distri- 
butions Bi(iV,p) and Po(iVp) such that the probability that they differ before they 
have reached at least n 1 / 3 individuals is at most n 1 / 3 ^; furthermore, conditioned on 
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both reaching ra 1 / 3 together, and being equal so far, the probability that they differ 
before they have reached at least en 2 / 3 individuals is at most en 2 ^ 3 p. Hence, 

\F(T(Bi(N,p) > en 2/3 )) - P(T(Po(iV» > en 2/3 ))| 

< n V 3 p + P(T(Po(iVp) > n^en^p. (2.8) 

Now, let N = [n — <5n 2 / 3 J and p = n~ l + An~ 4 / 3 and let n — > oo. Then Np — > 1, 
and thus, for each fixed M and n > M 3 , 

F(T(Po(Np) > n 1/3 ) < F(T(Po(Np) > M) -> P(T(Po(l) > Af). 

Since T(Po(l)) is finite a.s., the latter probability tends to as M — > oo, and it 
follows that P(T(Po(iVp) > n 1/3 ) — > 0. Consequently, the right hand side of ((2~l8"|) is 
0(n -2 / 3 ) + o(l) • 0(n _1//3 ) = o(n~ 1 / 3 ). The result follows from (|2.8|) and (ii), since 

JVp = (n - 5n 2 / 3 + 0(l))(n _1 + An" 4 / 3 ) = 1 + Ann" 1 / 3 , 

with A n = A - 6 + Oin- 1 / 3 ) — > A — <5. □ 

We give alternative formulas for U £ (X) defined in Lemma 12.11 

Lemma 2.2. Let — oo < A < oo and e > 0. Then 



U £ (X) = (l) 1/2 e- 1/2 + A + £(2vr)- 1 /2 x - 3 /2(i _ e -A 2 */2) dx 



(2.9) 



-) 1/2 e- 1 / 2 + A + (2vr)- 1 / 2 A 2 e 1 /2 + 0(A 4 e 3 / 2 ). (2.10) 



.71"/ 

Proof. First note that in the case A = 0, (|2.1|) yields 

/oo 
(2vr)- 1 / 2 x- 3 / 2 dx = (2/vr) 1 / 2 e- 1 /2. (2 . n) 

Since 2max(A,0) = A + |A|, (|2~Tj) further yields 

/oo 
(2vr)- 1 / 2 x- 3 / 2 (1 - e~ x2x / 2 ) dx. (2.12) 

Now, for A 7^ 0, by change of variables and a standard integration by parts, 
/ x -3/2 (1 _ e -AV2) dx = (A 2 /2) l/2 / y -3/2 ^ _ e - W ) dy 

Jo Jo 

= (A 2 /2) 1 / 2 2T(l/2) = (2vr) 1 / 2 |A|, 

and thus (|2~T2l yields 

U £ (X) = U e (0) + A + r^iry^x-^ 2 (1 - e"^ 2 ) dx. (2.13) 

This proves (|2~3J) . and (EHUj) follows by the expansion l- e - x ' 2x/2 = \ 2 x/2+0(\ 4 x 2 ). 

□ 

Remark 2.3. Expression (|2.1[) might lead the casual reader to suppose that A = 
was somehow special. The equivalent expression (|2.9|l . however, shows that ?7 e (A) is 
a smooth function of A. This corresponds to the generally held belief that there can 
be no further refinements of the critical window, that no value of A is more special 
than any other, and that natural functions vary smoothly with A. 
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Returning to the random graphs, note that given the graph G(n,p), the prob- 
ability that a random vertex belongs to a component of size at least en 2//3 is 
n _1 y[en 2/ ' 3 , oo) = n~ 1 / 3 Z n£ . Taking expectations we see that EZ„ £ equals n 1 / 3 
times the probability that a given vertex v belongs to a component of size at least 
£n 2/3 j n Q(n^py We explore the component containg the given vertex by the stan- 
dard breadth-first search. In this search, we explore first the neighbours of v, then 
their neighbours, and so on, see e.g. |2U or ^3 Section 5.2]. When we explore the 
neighbours of a vertex, we find Bi(n — m,p) new vertices in the component, where 
m is the number of vertices found so far. Thus, the process is dominated by a 
Galton- Watson process with offspring distribution Bi(n,p), and, if we stop when we 
reach en 2 / 3 vertices, dominates a Galton- Watson process with offspring distribution 
Bi(|_n — en 2 / 3 J ,p); hence, the probability that we find at least en 2 / 3 vertices in the 
component lies between the probabilities that these Galton-Watson processes have 
a total progeny of at least en 2 / 3 . Consequently, 

P(T(Bi( [n - en 2 / 3 J , n' 1 + An" 4 / 3 )) > en 2 / 3 ) 

< n~ 1/3 E Z n£ < P(T(Bi(n, n' 1 + An" 4/3 )) > en 2/3 ) . 
By Lemma 12. II and Proposition II. 4| this yields 

U e (\-e) <EZ E < U £ (X), 

and (jl.lj) follows by Lemma 12.21 

It seems more difficult to estimate Var Z ne by this method, and we will use another 
approach in Section |3 

3. Complexity 

The complexity c(G) of a graph G with v vertices and e edges is defined by 
c{G) := e — v + 1. Thus the complexity is for trees, 1 for unicyclic connected 
graphs, and > 2 otherwise. We say that a connected graph with complexity > 2 is 
complex. 

We can refine the point processes E n and E by considering the complexities of the 
components. We can think of this as giving each point in the processes a label; a 
point in H n is labelled by the complexity of the corresponding component. Formally, 
we can think of the labelled versions, H* and 5*, say, as point processes on the space 
(0, oo] x N, or better (0, oo] x N*, where N = {0, 1, . . . } and N* is the compact space 
N U {oo}. The results by Aldous PJ Corollary 2] and Luczak, Pittel and Wierman 

|18j referred to above actually consider the complexity too, and show that H* -i H* 
as n — > oo, for a suitable labelling H* of H. Aldous p] Corollary 2] describes S* by 
the process B x defined above: introduce a process of marks on (0, oo) that, given B x , 
is a Poisson process with intensity B^(s)ds; then, as said above, the points £j are 
the lengths of the excursions of B x , and each excursion is labelled with the number 
of marks inside it. In other words, given B x , each point gets a label that has 
a Poisson distribution whose mean is the area under the corresponding excursion, 
and different points are labelled independently. We will give another description in 
Theorem 13.11 below. 

We let, for t > 0, be the subset : c(C{) = £} of H n of points with labels £, 
i.e. the set of scaled sizes of components of G(n,p) with complexity t. Similarly, let 

E e be the subset of S of points with labels £. Since 5* S*, we have for 
every i. 
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Let C(k,£) be the number of connected graphs with complexity £ on k (labelled) 
vertices (they thus have k + 1 — 1 edges). Thus C(k, 0) is the number of trees, and 
by Cayley's theorem, C(k, 0) = k k ~ 2 . More generally, Wright (2^] proved that for 
every fixed I 

C{k,£) ~ w e k k+3e / 2 ~ 2 as k -> oo, (3.1) 
for some constants u;^, for which Wright 25 gave a recursion formula. (See also [111 
§8] and the references there.) We have wo = 1 and w\ = y ir/8. It was shown in 
[211 that 

EL' / % 

^ = — , * > (3.2) 

where L is the area under a normalized Brownian excursion. If we introduce the 
moment generating function of L, we thus have 

oo 

^(t) = Ee tL = Y,w e t e . (3.3) 

e=o 

The moments E Z/ and the moment generating function \& had earlier been studied 
by Louchard Q11[T7|. Note that V(t) is finite for all t > (and thus (f33|) holds for all 
complex t); indeed, as remarked in [SJ Remark 3.1] (where £ = 2L), it follows from 
the well-known asymptotics for wg, see e.g. ^3 §8] and ^3 Theorem 3.3 and (3.8)], 
that EL e ~ v / 18^(12e)- f / 2 £ £ / 2 as £ -» cx>, and thus Lemma 4.1(h)] implies, cf. 
Remark 4.9] , 

(t) ~ it 2 e' 2/24 as t -» +oo. (3.4) 

We now can state the result describing 3*. For a? > 0, let P x be the distribution 
on N given by 

Theorem 3.1. The point process 3* on (0,oo] x N* can be obtained from 3 6?/ 
independently giving each point £j 6 3 a random label with the distribution P^ . 

Proof. Conditioned on the vertex sets of the components of G(n,p), the internal 
structures of the components are independent. Moreover, a component of order k is 
distributed as G(k,p) conditioned on being connected. Let Pk, p (£) be the probability 
that such a component has complexity £. The probability that G(k,p) is connected 

and has complexity £ is C(k, £)p k+e -~ l {\ — p)(2)~ k ~ e+1 and thus 

P k>p (£) = C(k, £) {^-\ J £ C(k, £) (-£-) . (3.6) 

Consequently, the labelled process 3* can be obtained from 3 n by giving the points 
£ n i labels independently, such that the label of a point x has the distribution P xn 2/3 p 
given by (|3.6[) . 

By Bollobas 4., Theorem V.20], there exists a constant c > such that, for all k 
and £, 

C(k,£)<(c/ef/ 2 k k+3i / 2 - 2 . (3.7) 
Hence, if k < on 2 / 3 for some fixed 6, and n is so large that p/(l — p) < 2/n, 
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Consider a sequence k = k(n) such that kn 2 / 3 — * x for some x > 0. By (j3,l[) . 
for every £ > 0, as n — > oo, 

§H(t^)'-^ 3 ' /2 "-'( 1+ °«)-^ /2 < 3 - 9 ' 

Together with (|3.8|) . this implies by dominated convergence 

E^(t^)'-E- 3,/2 = *(* 3/2 ). <»o) 



C(fc,0) VI -p 

Consequently, from (j3.fi |) . (|3.9j) . (j3.1()|) and (|3.5j) . for n — > oo and every fixed i, 

Thus, the distribution Pfc iP converges to P x . 

Let 3' be the labelled point process constructed in the statement of the theorem. 
It follows from Lemma lA,2l and the Skorohod coupling theorem, see e.g. |141 Theorem 
4.30], that we may assume ^ n and s to be coupled such that — > ^ a.s. for every i. 
By the description of 3* above and the convergence of Pk tP to P x when kn~ 2 l 3 — > x, 

it follows that we may couple also the labels such that 3* ^ 3'. Hence 3* -i 3', 

and thus 3' = 3*. □ 

4. Intensity 



Let, changing the notation slightly from JH]) X n {k;£) denote the number of com- 
ponents with k vertices and complexity £ in the random graph G(n,p), and let 
Y n (k;£) := kX n (k;£), the number of vertices in these components. We further 
define, for an interval /, X n (I;£) := 2~2kel X n (k; I) , X n (k;>£) := Y^=i X n{k]j) 
and X n (I\>l) := Y^jLi X n (I; j), and similary for Y. Thus, for example, X n (k) = 
Z j Xn(k;j)=X n (k;>0). 

Consider now a fixed £ > and k < Cn 2 / 3 for an arbitrary constant C . Then, by 
well-known calculations, uniformly for all such k, 

EX n (k,t) = (Y)C(M)/ +M (1 -,,)<— *>*+G)-*-*« 



k , 1.2 



k 2 k 3 /jfe 



x (1 - p ^nk-k 2 /2-3k/2-£+l 

nl _,C(M) exp C_^ _ *L + (j. + ^ - l)An-V3 - ifcA 2 n- 2 / 3 
k! V In bn z 2 

-k- k\n- 1 ' 3 + — + -k 2 \n-^ 3 + 0(-)+ 0{n- 2 ' 3 - 
2n 2 \nJ 



1 _ e C(k,£) ( k 3 

n 



ex J_ k _ *!L _ I^n" 2 / 3 + ifc 2 An- 4 / 3 + (* - l^ 1 / 3 



n 



fc! K V 6n 2 2 ' 2 

+ 0(^)+0(n- 2 / 3 )) 

l_^C(M) e _ fc exp (_ F(A;n -2/3 ) A)) (i + {£ _ i) An -V3 + O(^) + 0(n- 2 / 3 )) 

(4.1) 
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where 



Note that 



F(x, A) := -U 3 - i* 2 A + \x\ 2 = {x X) * + A ° . (4.2) 
b 2 2 b 



w „ a; 3 (x-2A 2 x 3 . . 

Fix.X ) = — + x- ; -> — >0 (4.3 

v ' ; 24 8 ~ 24 ~ v ' 

for all x > and — oo < A < oo. 

In our first application of (|4.1|) . assume < a < 6 < oo and consider only 
k G [an 2 / 3 , 6n 2//3 ]. For such A; and fixed (|4,1|) gives, by (|3.1|) and Stirling's formula, 

EX n (k,£) ~ n 1 -^(2vr)- 1 / 2 ^/ 2 - 5 / 2 e- F ( fc ^ 2/3 ' A ) 
= (2vr)- 1 /2 u; ^^!y- 1 e -^/3,A),- 1) 

and summing over /c we obtain, as n — > oo, 

K(~i[a, b])=E £ X n(k, Q - (2n)-^w e J (a^V^ ^. (4.4) 

k=an 2 / 3 

Since — > we have, by Lemma [A.. 1| H£[a, 6] — > H^[a, 6] whenever a and 6 are 
continuity points of S . In this case, by Fatou's lemma, ES'[a, b] is at most the 
right hand side of (|4.4|) . 

For any a G (0, oo), a ± e are continuity points of S for all but at most countably 
many e € (0, a), and for such e we thus obtain a formula for E(H [a — e,a + e]) 
and thus an upper bound of E(H^{a}). Letting e — ► through such e, we see that 

E(5^{a}) = 0, so every point is a continuity point. Consequently, H^[a, b] S^[a, 6] 
whenever < a < 6 < oo. Summing over all £, we see that every point is a continuity 

point of H too, and thus H n [a, b] -i S[a, 6] whenever < a < b < oo. 

To prove convergence of the expectations, we verify uniform integrability by con- 
sidering second moments. (See also the more general Lemma 15 . 1 1 b elow : we will give 
a more elementary argument here, which in any case will be needed later.) 

For simplicity, fix £ and write E k '■= EX n (k,£). Further, let E k j denote the 
expected number of ordered pairs of distinct components of complexity £, of orders 
k and j, respectively, in G(n,p). Thus, if k ^ j then E k j = E(X n (k; £)X n (j; £)) , 
while 

E k>k = E(X n (k;£)(X n (k;£) - 1)) = E(X n (k;£)) 2 - E k . 

Consequently, 

, bn 2 / 3 ,2 bn 2 / 3 bn 2 / 3 bn 2 / 3 

E{X n {[an 2 l\bn 2 '%£)) 2 = e( £ X n (k;£)J = E kJ + E E * 

k=an 2 / 3 k=an 2 / 3 j=an 2 / 3 k=an 2 / 3 

(4.5) 
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We have, cf. (|4.1|) . by simple calculations, assuming, say, k + j < n/2, 

E k>j =( k ™ ) + k j ) C(k, £)C(j, ()p k+l - 1+ ^-\l - pr (k +] )-(k+j) 2 /2-3(k +j )/2-2i + 2 



j- 1 



= EkEj exp(J2 - ——:) ~ kj ln(l - p) 

i=o n 1 

_ „ /i-^ /£: fcz A; 2 ^/ki 2 + k 2 i + k 3 \\ , ^/fc? 

= El£j -ex P (j:-( + + +0 ( — ^ — )) +to+0 (^ 

i=0 

T-i j-i / u . -4/3 k 2 j + kj 2 (kj kj(k 2 +f) \\ 
= E k Ejexp(Xk3n 1 + W + ^ ))■ ( 46 ) 

In particular, for k,j < Cn 2 ! 3 , E^j = 0(E k Ej), and Q4.5|) implies, for fixed £, a 
and b, with < a < b < oo, recalling that EE £ n [a, b] = O(l) by 

E(Hi[a,6]) 2 =E(X n ([an 2 / 3 ,6n 2 / 3 ];^)) 2 

= 0((EI n ( [an 2 / 3 , on 2 / 3 ] ;l)) 2 +EI„( [an 2 / 3 , bn 2 ' 3 } ; *) ) 

= 0((EHi[a,6]) 2 + EHi[a,6]) = O(l). 



Thus, the random variables S^[a, 6] are uniformly integrable, and S^[o, 6] H^[o, 6] 
implies EH^[a,6] — > EH^[a,6], see e.g. Theorems 5.4.2 and 5.5.9]. Consequently, 
E E [a, b] equals the right hand side of (|4.4|) . This leads to the following result. Recall 
that \& denotes the moment generating function ()3.3j) of the Brownian excursion area. 

Theorem 4.1. The point process has intensity := (27r) _1 / 2, iU|X 3 ^/ 2— 5 / 2 e~ "^'^ 
on (0, oo). Their sum E /ias i/ie intensity, for < x < oo, 

oo 

A(x) = AW(s) := ^ A<(z) = (2vr)- 1 / 2 x- 5 / 2 ^(x 3 / 2 )e~ F ( a; ' A ). (4.7) 
^=o 

Proof. We have shown that ES'[a, 6] = Ai(x) dx when < a < b < oo, which 
by definition shows that A^(x) is the intensity of S . The second part follows by 
summing over I. □ 

Corollary 4.2. 

/OO 
(27T)- 1 / 2 x- 3 ^(x 3 / 2 )e- F ^ dx. (4.8) 

Proof. E Z £ = xA(x) dx. □ 

We already know that the expectation in (|4.8|) is finite; that the integral converges 
follows also by Q3.4|) and (|4.3|) . which imply that A(x) decreases exponentially as 
x — ► oo. Note further that the intensity A^(x) ~ (27r) _1 / 2 x~ 5 / 2 as x — > 0, for every 
A. 

Remark 4.3. The intensity A^ has a finite integral over (0, oo) precisely when the 
exponent 3£/2 — 5/2 > — 1. Thus, for £ = and £ = 1, E^ has an infinite expected 
number of points; indeed, it is easily shown from (|7.1[) and (|7.2|) that E £ a.s. has 
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an infinite number of points. On the other hand, for any i > 2, has a finite 
number of points. Further, ^£>2 " > the point process for the complex components, 
has a finite number of points. One may view this in an evolutionary way. Roughly 
speaking, when A is large negative complex components have not yet formed. When 
A is large positive a "dominant component" will have formed which is complex. 
But there will not usually be other complex components as components get "sucked 
into" the dominant component before becoming complex. In it is shown that 
with probability converging to 57r/18 ~ 0.87 there is never more than one complex 
component in the entire evolution of the random graph. 

Remark 4.4. Considering the difference E — E zf^ and letting e —> 0, we find 
from Theorem 11.11 and Corollary 14.21 the identity 

(27r)- 1 /2 x -3/2 xI , (x 3/2 ) ^-F( a; ,A)_ e -F(x,0)^ dx = J" x ^ (x)-A^ ( X )) dx = A. 

2 (4-9) 
Differentiating with respect to A we further find — X)A^ x \x) dx = 1 or 

x 2 (x - 2X)A {x) (x)dx = 2, 



and thus E £\ = 2 + 2A E £\ £ 



2 



Remark 4.5. We similarly find expressions for the expectations of sums of all 
points in H* with a given label. For example, for label 1, corresponding to unicyclic 
components, we obtain for the expectation of the total weight of H 1 

xAi(x)dx = \ e~ F{x ' x) dx. (4.10) 



Thus, cf. Remark 14.31 the H 1 process has an infinite number of points with finite 
sum. See also |18l Lemma 2.2], which implies both ()4.10|) and 



oo />oo 



:(AW(x) - A [ X) (x)) dx = (2TT)- 1 / 2 / x" 3 / 2 (l - e - F(x > x y) dx + A 



which indeed also easily follows from (|l.lj) and (|4.8j) . The expectation of the sum 
of all points with label at least 2 (corresponding to the total size of the complex 
components in G{n,p)) is / °° x[A^(x) — Aq X \x) — A^ (a;)) dx; an evaluation in 
terms of hypergeometric functions is given in [111 (15.12)]. 

5. An ESTIMATE FOR G(n,p) 

We prove in this section an estimate for the components of G(n,p) that we will 
need. This estimate is known, at least in principle, but we do not know any precise 
reference. 

Lemma 5.1. For any fixed e > and q > 0, 

E((X n [en 2 / 3 ,oo)) 9 ) =0(l), E ((Y n [en 2 l\ oo)) 9 ) = 0(n 2 ^ 3 ) . 

In other words, W n£ , Z ne and Z' n£ have moments that are bounded, uniformly in n. 

Proof. It suffices to prove the result for Y n , since Y n [en 2 / 3 , oo) > en 2 ^ 3 X n [en 2 ^ 3 , oo). 

Let us begin with the complex components; in this case we do not need a lower 
bound on the size of the components. (This is not surprising, since typically there 
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are no small complex components.) Let n c (G) denote the number of vertices in 
complex components of a graph G. Thus n c {G(n,py\ = do), > 2) . 

The excess of a graph, as defined in §13], equals the complexity minus the 
number of complex components. (Thus, a component of complexity i contributes 
max(l- 1, 0).) We first claim that there exists r] > such that if q > 1 and jnGi are 
fixed, and we momentarily consider the random graph G(n, m) with a fixed number 
of edges m = [_§ (1 + /in _1 / 3 )J , then 

E(n c (G(n,m)) 9 l[ excess (G(n, m )) = r ] ) = 0(n 2g / 3 (r + l)^"^) , (5.1) 

uniformly in n and r > 0. 

Indeed, the case q = of ()5.1|) is a special case of ^1 Lemma 5] (with d = 0), 
and the general case follows by a simple modification of the (not so simple) proof, 
as is remarked for the case q = 1 on pages 299-300] . 

We may thus sum (|5.1|) over r > 0, and find, still for fixed q and /i, 

E(n c (G(n,m)) q ) = 0(n 2q l ?J ). (5.2) 

Given A, choose ^ > A and observe that by a standard Chernoff estimate, see for 
example |12l Theorem 2.1], the probability that G(n,p) has more than m edges is 
0(e~ Snl 3 ) for some 5 > 0. Since n c is monotone if we add edges, any coupling of 
G(n,p) and G(n,m) thus gives, for fixed q > 0, 

E(Y n ([l,oo),>2) 9 ) =E(n c (G(n,p)) q ) < E(n c (G(n, m)) q ) + 0(^e~ 5 ™ 1/3 ) 

= 0(n 2 "/3). (5.3) 

For components of complexity or 1, i.e. trees and unicyclic components, it is 
possible to argue as for the second moment in Section |1J but we will instead use a 
trick together with the result just proved. 

Let P(n; k\, . . . ,kj\£) be the probability that j given disjoint subsets of the vertex 
set of G(n,p), with sizes k\, . . . ,kj respectively, all are the vertex sets of components 
with complexities t. Thus, with k = k\ + • • • + kj, 

j 

P(n; &!,...,%;£) = (i _ p )nfc-i 2 /2-3V2-j«+j JJ (5.4) 

i=l 

It is easily seen that for any integer q > 1, 

q 

E(Y n ([A,^)-e) q ) =J2 E c(n;q;j;k 1 ,...,k j )P(n;k 1 ,...,k j ;£), (5.5) 

j=l k\,...,kj>A 

for some combinatorial coefficients c(n; q; j; k\, . . . , kj) not depending on L For fixed 
I and £', we have by (jO|l and (|3~T|) . 



P(n;h,...,k f ,£) c^.^) Vi-p 



pi —P 



Hence, if f < £ and fcj > en 2//3 , we have P(n; ki, . . . , kj; £') = 0(P(n; ki,..., kj)l)) 
(recall that e is fixed), and (|5.5|) yields 



E 



(Y4[En 2 /^oc) ] £') q )=o(E(Y4len 2 /\ooy,e) q )) 
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We apply this with £' = and 1 and £ = 2, and obtain from (j5.3|) the required 
estimates for E(Y n ([en 2 / 3 , oo); 0) 9 ) and E(Y n ([en 2 / 3 , oo); l) 9 ), which together with 
(|5.3j) complete the proof. □ 

Before we proceed, we point out a simple consequence. Peres [2U] recently gave 
a simple proof (with an explicit bound) of the case q = 2; this case is equivalent 
to E|C(v)| = (^(n 1 / 3 ) where C(v) is the component containing a given (or random) 
vertex v. 

Corollary 5.2. Let q > 3/2. Then EX)i£ni = ^(-Oi equivalently, for G(n,p), 
E Ei = C(n 2 ^ 3 ) /or any 5 > 3/2. 

Proof. First, X^r|Cj|>n 2 / 3 — ^n["- 2//3 5 co) 9 , whose mean is 0(n 2q / 3 ) by Lemma l5~Tl 
Similarly, the sum over the complex components has expectation 0(n 2<?//3 ) by 
(|5.3|) . It thus remains only to consider components of size at most n 2 / 3 with com- 

2 /3 

plexity or 1. The corresponding sum has expectation Efc=i k q (tk + Uk), which is 
0(n 2 ^ 3 ) by Q and ljfi~7j) . □ 

Proof of Proposition \1.4\ It is an easy consequence of Lemma fA.2l that the mappings 
^ X^> e &j ^ £&>e& and fel* ^ #{^ : 6 > ^} are measurable on 

91(0,00] and continuous at every such that e ^ 

Since H n -i H and P(e G H) = by Theorem 14. H the results Z ne — > Z £ , -i Z e 
and W ne -i W £ follow by the continuous mapping theorem, see e.g. Theorem 

Since the moments of Z ne , Z' n£ and W ne are bounded uniformly in n by Lemma f5.ll 
this further implies convergence of all moments, see e.g. @ Theorems 5.4.2 and 
5.5.9]. □ 



6. Second partial proof of Theorem 11.11 

In this proof we do the calculations with the small components, and consider 
complexities and 1 separately. Let throughout < e < 1. 

Consider first the tree components. Let tk = EI n (fc;0) be the expected number 
of tree components of order k. By (|4.1|) . for k < n 2 / 3 , 

tk = rS^-e~ k exp(-F(kn- 2 /\ A)) (l - An" 1 / 3 + o(£) + 0(n" 2 / 3 )) . (6.1) 
In particular, with t* k := nk k ~ 2 e~ k /k\, 

t k = 0(tl) = 0(n k ^e- k )=0(-^). (6.2) 



Note further that, for any fixed real a and all e > 0, 

f«:=fo(^) = {°w»^)-o( 



n 2Q / 3 e Q - 3 / 2 ), a>3/2, 
a < 3/2. 
(6.3) 
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By (|6.1[) and (|6.3j) we obtain, 

en 2 / 3 en 2 / 3 rk—1 

EY([l,en 2 /%0) = £ kt k = n £ V'"^ 1 " 

k=l k=l 



+ °(Y1 kt* k (F(kn- 2 / 3 ,X) + kn- 1 + n 

k=l 

(„ _ An 2 / 3 ) ]T ^pe- fc + 0( £ t^n" 2 + JfeV 

k=l ' k=l 

en 2 / 3 , 

(„ _ An 2 / 3 ) ^ ^e" fc + o(e 1 / 2 n 2 / 3 ). (6.4) 



fc=i 

Moreover, using the fact that k fc! e~ fc = 1, and Stirling's formula, 
L> k\ ^ k\ 

k=l k>en 2 / 3 



= 1- (2vrfc 3 )- 1 / 2 + 0( k ~ 5/2 

k>en 2 / 3 k>en 2 / 3 
poo 

= 1 - / (2TTX 3 )- 1 / 2 dx + 0{e- 3 l 2 n- 1 ) 

Jen 2 / 3 

= 1 - yi e -V2 n -l/3 + ( £ -3/2 n -l). (6 . 5) 

Consequently, combining (|6.4j) and 1)6. 5 Jl . 

EY n ([l,en 2 / 3 ];0) 

^I e -V2 n 2/3 _ An 2/3 + Q ^l/2 n 2/3 + £ -3/2 + ^1/2^1/3^) ( g g) 



Next, let Ufc = EX n (fc; 1) be the expected number of unicyclic components of 
order k. We have, cf. (|17T|) and (|XT|) . 

^=Qc(M)/(i-p) M)fc+ ^ 

C(fc,l) n 



= 0(n- 1 fc 3 / 2 t fc j, (6.7) 

and thus, by ()6.3j) . 



E7n([Mn 2/3 ];l) = ^ feu* = Ofn" 1 ^ fc 5 / 2 t fe ) = 0(n 2 / 3 e). (6.8) 



k=l k=l 



For complex components we use the well-known fact that EX n ([l, oo); > 2) is 
bounded; see the stronger result in [7j, Theorem 5.8(i)]. (As a bound we can 
take 1.2, say, at least for large n, and possibly 1, as conjectured in ^|.) Hence, 

EY n ([l,en 2 / 3 ];>2) < en 2 / 3 EX n ([l, en 2 / 3 ]; > 2) = 0{n 2 ' 3 e). (6.9) 
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Adding (|6.6j) . (|6.8|) and (|6,9|) . we find, since the sum of all component sizes 
r„([l,oo))=n, 

EZ; £ = n" 2/3 EK„(en 2/3 ,oo) = n~ 2/3 (n - EY n ([l, en 2/3 })) 

= ^1 e -l/2 + A + O ^1/2 + £ -3/2 n -2/3 + £ -l/2 n -l/3^ 

Thus, letting n — ► oo, by Proposition 11,41 

which is (jl.lj) with the weaker error term 0(e l l 2 ). 

Next, consider the variance of y n ([l,en 2 / 3 ];0). Similarly to g3J) we have, with 
t = 0, and Ek = ifc, 

(en 2 / 3 v 2 en 2 / 3 en 2 / 3 en 2 / 3 

]T kX n (k;£)) = E E k 3 E K 3 + E k2Ek - ( 6 - 10 ) 
fc=l ' fc=l j'=l k=l 

Hence, using gSJ) and letting A 2 := Efc=i /3 fc2 *fc = C^n 4 /^ 1 / 2 ), by (IQl) . 

en 2 / 3 en 2 / 3 era 2 / 3 

Var(y n ([l,en 2 / 3 ];0)) = £ k\ + ]T kj(E kd -t k tj) 

k=l k=l j=l 

en 2 ! 13 en 2 ! 13 en 2 / 3 

= E ^+ E E WjO^jn- 4 / 3 ) 
fc=i fc=i j=i 

en 2 / 3 era 2 / 3 era 2 / 3 

= E^ + °(E E^' 2 ^~ 4/3 ) 

k=l k=l j=l 

= A 2 + 0(A 2 2 n~ i/3 ) =A 2 + 0(n 4 / 3 e) . (6.11) 

In particular, by (|6.3|) . this variance is 0(n 4/3 e 1/2 ). 

The variance of Y n ([1, en 2 / 3 ] ; l) can be computed in the same way, with replaced 
by Ufc. Since = 0(e 3 ^ 2 tk) for k < en 2 / 3 by ()6.7|) . we obtain the estimate 

Var(Y n ([l,en 2 / 3 ];l)) = 0{e 3 l 2 A 2 + e 3 A 2 n^l 3 ) = 0{n 4 ' 3 e 2 ). (6.12) 

For the complex components we now use the fact that also E(X n ([l, oo); > 2) 2 ) 
is bounded [7j, and thus 

Var(Y n ([l,en 2 / 3 ];>2)) < E(Y n ([l, en 2 / 3 ]; > 2) 2 ) < e 2 n 4 / 3 E(X n ([l,en 2 / 3 ]; >2) 2 ) 

= 0(n 4 / 3 e 2 ). (6-13) 

By the Cauchy-Schwarz inequality, the three covariances between the three vari- 
ables in (IfHTT) . (l6~T2l and (HOSl) are all 0(n 4 / 3 e 5 / 4 ), so summing the variables we 
find from these formulas that 

Var(Y n (en 2/3 ,oo)) = Var(Y n [l, en 2/3 ]) = A 2 + 0(n 4/3 e). (6.14) 
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Moreover, by 1)6.1)1 . 1)6.3 J) and Stirling's formula, 

en 2/3 en 2 / 3 

A 2 = k\ = £ fc 2 ^(l + 0(fcn~ 2 / 3 + n-VB)) 
fc=i fc=i 

en 2 / 3 

= ^ n((2vrA;)- 1 / 2 + O^" 3 / 2 )) + 0(n 4 / 3 e 3 / 2 + ne 1 / 2 ) 
fc=i 

= (2/7r) 1 / 2 e 1 /V+ 1 / 3 + 0(n + n 4 / 3 e 3 / 2 ) . 
Thus, yields 

Var Z; £ = n" 4 / 3 Var (y„ (en 2 / 3 , oo)) = (2/n) 1 / 2 e 1 / 2 + 0(e + n" 1 / 3 ). 
and 1)1.2)1 follows by Proposition 11.41 

7. Proof of Theorem 11.21 
By Theorem l4~T1 and 1)4.2)1 . for < e < 1 and fixed A, 

/OO Z 1 ! 
A(x) cix = EV^i + y (2vr)- 1 / 2 x- 5/2 ^(x 3 / 2 )e- F(a ' A) dx 

= 0(1) + ^ 1 (2^)- 1 / 2 x- 5/2 (l + u»ix 3/2 + 0(x 3 )) (1 - F(x, A) + 0(x 2 )) dx 

= ^ 1 (2tt)- 1 / 2 x- 5 / 2 (1 + wix 3/2 - xA 2 /2 + 0(x 2 )) dx + 0(1), 

and Cd follows. (Recall that w x = (tt/8) 1 / 2 .) 

For the variance, we use 1)4.5)1 with a = e and 6=1 and argue as in Section H3 
Calculations similar to ()6.11)1 yield 

Var(X n ([en 2 / 3 ,n 2 / 3 ];0)) =E(X n ([en 2 / 3 ,n 2 / 3 ];0)) +0(e- x ) = 0(e^l 2 ) (7.1) 
and 

Var(X n ([en 2 / 3 ,n 2 / 3 ];l)) = E(X n ([en 2 / 3 , n 2 / 3 ]; l)) +0(1) = 0(ln(l/e)); (7.2) 

we omit the details. Using again the fact that E(X n ([l, oo]; >2) 2 ) = O(l) together 

with E((X n (n 2 / 3 ,oo]) ) = 0(1) (L emma l5~T)l and the Cauchy-Schwarz inequality, 
we obtain 

Var(X n [en 2/3 ,oo]) = E(X n [en 2/3 , oo]) + 0(e _1 ). 

Letting n —* oo, we find using Proposition PI Var W e =EW £ + O^ 1 ), and (PJ 
follows. 

8. The Palm distribution 

The Palm distributions of a point process S in a suitable space 6 are the con- 
ditional distributions £(H | s S H) given the presence of a given point s, s £ &. 
(Usually, s G 6 is an event of probability 0, so this must be interpreted with some 
care, see Chapter 10]. In particular, note that the Palm distribution is uniquely 
determined only for a.e. s.) 

In our case, the Palm distribution is obtained by a simple shift of the parameter A; 
we thus write = }i in this section. Recall that we regard as a random 
measure on (0, oo) that is the sum of the pointmasses 6 (a), see Appendix 1X1 
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Theorem 8.1. The Palm distribution £(3^ \ s £ H^- 1 ) equals for every s > the 
distribution o/H( A ~ s ) + 5 S . 

Proof. Given that G(n,p) has a component of size m on a certain set of vertices, the 
remainder of the graph is distributed as G(n — m,p). Hence, if *Tt = 91(0, oo] is the 
space of locally fknite integer- valued measures on © = (0, oo] defined in AppendixEI 
and / : (0, oo] — ► R and g : 91 — > R are bounded continuous functions and / has 
compact support, then 

E( 5 (S n ) I™ fd~ n ) =E(s(3n) £/(£*)) =E(j2K(tini)), (8-1) 

"'° i i 

where /i n (s) = /(s) Eg(E n _ sn 2 /3jP + 5 s ). If n — > oo and s n — ► s, then (n — s n n 2 / 3 )p = 

1 + (A - 8 + o(l))n- 1 / 3 and S n _ ann 2/3 iP H( a ~ s ), and thus h n {s n ) -» /i(s) : = 
/(s)E 5 (H( A " s ) + <5 S ). It follows, using Lemma IA.2I and [HI Theorem 5.5], that 
J2i h-n((,ni) —> Y^,i an d thus by (|8.1j) and dominated convergence, 

E 



p OO OO 

9(H (A) ) / /(EM) = lim E( 5 (H n ) / fdE n ) = lim E(YX(£n 

/* OO /* oo 

= e(J>(&)) = / h(s)dME^(s) = J f( S )Eg(E^ + S s )dE^ x \ S ), 

(8.2) 

where <iEE( A )(s) = A^ x \s)ds by Theorem 14.11 It follows by a monotone class 
argument (e.g. jSJ Theorem A.l]) that the first and last terms are equal for any 
bounded measurable g, and the result follows, see ^3 (10.2)]. □ 

Note that Theorems 18.11 and 11.11 imply that for any fixed A and s > 0, for small 
e (so that e < s), E(zJ A) | s 6 H w ) = Ezi A ~ s) + s = EZ^ X) + O^ 1 / 2 ). Hence the 
existence of a certain point in E asymptotically does not influence EZ £ for small e, 
showing the rigidity of E. 

Theorem 18.11 can be put in a computational form as follows. Let, as above, 
91 = 91(0, oo] be the space of integer- valued measures defined in Appendix lAl 

Theorem 8.2. For any bounded or non-negative measurable function F : (0, oo) x 

91 -> [o,oo], 

poo 

EVF(C1 A) ,H( A )) = / EF^H^ + 5 X )A ( - X \x)dx, (8.3) 
Jo 

where A^ x \x) is given by (|4.7jl . 

Proof. First consider F of the special form F(x,S) = f(x)g(5), where, as in the 
proof of Theorem 18. 1) / : (0, oo] — > R and g : 91 — > R are bounded continuous 
functions and / has compact support. Then 1)8.2)) holds, which can be written 

poo poo 

E/ F(x,E^)dE^(x)= EF(s,H( A ^ + S x )A^ x \x)dx. (8.4) 
Jo Jo 

By another monotone class argument (e.g. [HI Theorem A.l]), 1)8.4 J) holds for every 

bounded measurable F, and thus by monotone convergence for every non-negative 

measurable F too. 

The integral on the left hand side of 1)8.4)1 equals F(£^\ E( A )), which yields 

(EH). ' ' □ 
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We give some applications. 
Corollary 8.3. Let A (A) (x) be given by f|4.7|) . Then, for every e > ? 

EZ £ 2 = / x 2 AW(x)dx+ / / xy^ x \x)A. {x - x \y)dydx 



and thus 

VarZ £ = I x 2 A^ x \x)dx- J J xyA^ (x)(A^ (y) - A^~ x \y)) dy dx 
xyA^ (x) (A^ (y) - A^ x ~^ (y)) dy dx. 

Proof. Take F(x,E) = x J^°ydE(y)l[x > e] in l|8.3|) . or f(x) = xl[x > e] and 
^(S)=/ 6 00 ydS(y)ine2>,tofind 



E 



xE(J ydE^ x - x \y) + a?) A< A > (a?) da;, 



which yields the formula for E Z 2 by Theorem 14.11 (or Corollary I4.2JI applied with 
A — x. 

The first formula for Var Z e follows immediately, and the second follows because 
(|4.9|) implies 



poo 

/ y{A^{y)-A^\ y ))dy 
Jo 



oo roo poo 



X 

and thus 

/*OC 

x 2 A^(x)dx = I I xy(A^ x \y)-A ( - x - x \y))A^(x)dydx. 

Je JO □ 
Corollary 8.4. E^ is a.s. simple, i.e. lacks multiple points. 

Proof. Take F(x,E) := > 2] in (|8.3|) . The left hand side becomes the ex- 

pected number of multiple points (with multiplicities), while the right hand side is 
because, for each x, EF(x, 3( A_X ) +8 X ) = F(E^ x ~'^{x} > l) = 0, using Theorem IP 
which shows that the intensity of ~( A_:r ) is absolutely continuous. □ 

Corollary 8.5. The largest point £^ in E^ has a distribution with the density 
function h[ X) (x) ■.= F(E^ x ~ x \x,oo) = O)A^(x). 

Proof. Let / : (0, oo) — > [0, oo] be a measurable function and take F(x, H) : = 
f(x)l[E(x, oo) = 0] in Theorem 18.21 Since E^ is simple by Corollary 18.41 the left 

hand side of QH.MJI becomes E/(£^), and the right hand side is J °° f(x)h[ X \x) dx. 
Since / is arbitrary, the result follows. □ 

The proof immediately extends to the following, more general, result. 

Corollary 8.6. For any k > 1, the k:th largest point Q in E^ has a distribution 
with the density function h^(x) := ¥(E^ x ~ x \x, oo) = k — l)A( A )(x). □ 

Corollary 8.7. For any Borel set B C (0, oo) and k > 1, 

E(S( A '(B)-) = /••• / A^{x 1 )A ( - x - x ^(x 2 )---A^- Xl —- Xk - 1 \x k )dx k ---dx 1 . 
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Proof. For k = 1, this is just the definition of intensity, see Theorem 14.11 For k > 2, 
we use Theorem IO with F(x,S) := l[x G S](5(S) - l)— , which yields 

/•oo 

E(5(A)(B)^)= / 1[^B]E(S^(B))- A^(x)dx, 
Jo 

and the result follows by induction. □ 

Remark 8.8. It follows immediately that if B C [a, 6] with < a < b < oo, then 
E(H( A )(i?)-) = 0(C k ) as fc — > oo, for some C < oo depending on B and A; with 
only a little more effort, the same can be shown also for B C [a, oo]. This implies 

Ee t3W(B) < ^ for 

every such i? and t < oo. In particular, the distribution of 
H^(i?) is determined by its (factorial) moments. Hence the formula in Corollarv l8.7l 
in principle determines the distribution of (B) for any relatively compact B C 
(0, oo]. Moreover, the formula in Corollary 18.71 easily extends to mixed factorial 
moments of S (A) (Si), . . . , H (A) (5 m ) when Bi,...,B m are disjoint relatively compact 
Borel sets. This extension, which we leave to the reader, characterizes the joint 
distribution of 3^ x \Bi), . . . (B m ), and thus ^3 Theorem 3.1] the distribution 
ofoW. 

Remark 8.9. If B C [a, oo] for some a > 0, we have, using the estimate in Re- 
mark the standard formula 

00 (—-\) k 

k=0 

which together with Corollary 18.71 (and perhaps the Bonferroni inequalities) can be 
used for numerical evaluation of P(3^(B) = 0), and thus, in particular, of the 
density function in Corollary 18.51 

Remark 8.10. It follows easily from Theorem 13 . 1 1 that a result analogous to The- 
orem |H3 holds for H* too. 

9. Limits as A — > ±oo 

In this section we consider limit results for H^, and in particular for the largest 
point as A — ► ±oo. These results are equivalent to limit results for G(n,p) with 
p = n~ l + A(n)n~ 4 / 3 with A(n) — ► ±oo slowly, but we get in this way no information 
on the allowed range of A(n). 

Consider first A — ► — oo. By l|4.2|) . F(x,X) — > oo for every x > and F(x,X) 
is monotone in A for A < 0. Recalling the notation W £ := H( A )[e,oo), it follows 
by dominated convergence that, for every fixed e > 0, E W £ = A^{x) dx -► 0. 
Hence, F(W e > 0) -> and P(S[e, oo) = 0) -> 1. Consequently, S (in the vague 

topology, see Appendix lA"|) and £^ 0. 

We can by much more precise. For |A| > 1, let 

a x := 31n|A| - f lnln|A| - \ ln(2 4 3 5 vr), (9.1) 
so that, as A — ► ±oo, a\ ~ 3 In | A | and 

e -«* = |A|- 3 (ln |A|) 5 / 2 (2 4 3 5 vr) 1/2 „ 4 vr 1 /2| X\^a 5 x /2 . (9.2) 
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Theorem 9.1. As A — ► — oo, 

l A l%( A ) „ At/ 

where V has the Gumbel (extreme value) distribution ¥(V < s) = e~ e 

Proof. Fix a real s, and let N^(x) := H( A )(x,oo), the number of points in SW 
larger than x. Thus EiV (A) (x) = EH (A) (x, oo) = A (A) (y) dy. With the change of 
variables y = 2A _2 (a A + 0) we obtain 

poo 

EN^(2\- 2 {a x + s)) = 2X- 2 A^(2\- 2 (a x + t)) dt. (9.3) 

J s 

For A < and any real t we have, by (|4.2|1 . 

F(2A- 2 (a A + t), A) = |A- 6 (a A + t) 3 + 2|A|- 3 (a A + t) 2 + (a x + t) = a x + t + o(l), 

as A — > — oo with i fixed. Since — ► 1 as x — > 0, it follows from this, (|4.7|) and 
~H) that 



2A- 2 AW(2A- 2 (a A + t)) 

= (2^)- 1 / 2 2" 3 / 2 |A| 3 (a A + t)- 5 / 2 ^(2 3 / 2 |A|- 3 (a A + t )3/2) e -F(2A-»(a A +t),A) 



ivr-^IAlV^e-^-'+^^e-*. 



Moreover, for i > s and A < with |A| so large that a A > 2|s| we also obtain, using 
= 0(e x2 / 6 ) from (jS3J) and A) > x 3 /6 + xA 2 /2 from (OJ), 

2A- 2 AW(2A- 2 (a A + t)) =o(|A| 3 a- 5/2 e-^ +t )) =0(e" f ). 

Consequently, we can use dominated convergence in (|9.3|) and thus 

EAT( A )(2A- 2 (a A + s)) - 



oo 

t 



Higher factorial moments can be computed similarly using Corollary 18.71 with 
B = (2A _2 (a A + s),oo) and Xj = 2A _2 (a A + tj). Note that, for fixed tj, Xj — > 
0, and thus F(xj,X — x\ — ■ ■ ■ — = F(xj,X) + o(l). Note further that, for 

A < and every u,x > 0, u \x) < A^ A -*(x); hence the bound used to verify 
dominated convergence above applies to each factor in this multivariate setting too. 
Consequently, for every k > 1, 

/OO /'OO 
... / e -tx . . . e -t k dtk . . . dh = ( e - S )fc 

By the method of moments, this implies (2A~ 2 (aA + s)) — > Po(e _s ), and thus 



P(^1 A) < 2A- 2 (a A + s)) = ¥(N^(2\- 2 (a x + a)) = 0) - e~ e ~ s . 

Remark 9.2. The proof yields also the asymptotic distribution of £g > £3^; 
In fact, for every fixed i, as A — ► — oo, 

i— 1 _ ■ 

Pfe (A) < 2A- 2 (a A + ,)) = P(iV( A ) (2A- 2 (a A + .)) < i) -> £ ^V e ^; 

i=o ■ 7 ' 



□ 
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if we write the right hand side as P(Vj < s), this can be written 

Note that these asymptotic distributions are the same as for the i:th records of 
suitable i.i.d. sequences, see Section 2.2]. 

More generally, the proof above is easily extended to show that 3^ with the 
points rescaled as above, converges in distribution to a Poisson process on (— oo, oo) 
with intensity e~ s . (This holds in 9T[— a, oo] for every a, say; we cannot use oo, oo] 
directly, since the rescaled processes are not elements of this space.) Thus for 
A — > — oo, the point process becomes Poisson-like. 

In particular, £ 4 - A ^ is roughly 2\~ 2 a\ ~ 6 In |A|/A 2 for every fixed % > 1. This can 
be made precise in the following form, where we use the notation that X\ ~ p x\ if 
Xx/x x ^ 1. 

Corollary 9.3. As A -► -oo, ^{ A) ~ p 61n|A|/A 2 for every fixed i > 1. □ 

Remark 9.4. The asymptotic results for large negative A in Theorem 19.11 and 
Remark E21 have a natural interpretation. The results on the asymptotic distribution 
of are what they would be if were replaced by a Poisson point process with 
intensity A^) . This corresponds to the view that as one moves in the critical window 
toward the subcritical phase the largest components become "local phenomenon" 
and their interaction becomes negligible. 

Let us now turn to A — > +oo. It is well-known that in this case, with probability 
tending to 1, rS^ contains exactly one large point. In fact, £^ -h> oo and £^ 
as A — > +oo. Again, we can be much more precise. 

Let X and Y by two random variables. The total variation distance between the 
distributions of X and Y is defined as 

d TV (X,Y) :=sup|P(A: G B)-F(Y G B)\, 

B 

taking the supremum over all Borel sets B. Note that this only depends on the 
distributions C(X) and C(Y), although we for simplicity use the notation d^viX, Y) 
instead of dTy(C(X),C(Y)); we will also write dxv(A^, //) when Y has distribution 
[i. Note also that c?tv is a very strong measure of distance between distributions; 
for example, for a sequence X n , d^y(X n ,Y) — » is much stronger than X n —> Y, 
and thus (i) below is stronger than asymptotic normality in the standard form 

(^ A) -2A)/727A^iV(0,l). 
Theorem 9.5. If A — > +oo, then 

(i) d TV (eS A) ,A r (2A,2A- 1 )) ^0; 

(ii) - a x -i V, with a\ as in (|9.1I) and V as in Theorem \9.1[ 

The proof below also shows that ^|A| 2 £ l - A ' 1 — a\ —> Vi-\ for every i > 2, with Vi as 
in Remark 19.21 

Corollary 9.6. As A -> +oo, ^ A) ~ p 2A and ~ p 6 In A/A 2 for every fixed 
i>2. ' □ 

To prove Theorem 19.51 we begin with two lemmas. Let tp\ denote the density 
function of N(2A, 2A" 1 ); thus, <p x (x) = (A/4^) 1 /2 e -A(x~2A) 2 /4_ 
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Lemma 9.7. As A — » +00, 

z>oo 

|A< A )(x) - <px(x)\ dx -► 0. 



The lower limit A is for convenience only; it can easily be replaced by, e.g., 1. 

Proof. For x > A, {QJ yields f(x 3 / 2 ) = ixV 3 / 24 (l + o(l)), with o(l) ^ as 
A — ► +00, uniformly in x > A. Using \a 1 / 2 — b 1 l 2 \ = \a — 6|/(a 1 / 2 + b 1 / 2 ) and 
\e a - e b \ <\a- &| e max { a > fe }, we thus find from g2) and (g3J), for x > A, 

AW(x) = (2vr)- 1 / 2 ix 1 / 2 e- a; ( a; - 2A ) 2 / 8 (l + o(l)) 

= (8vr)- 1 /2 (2A) i/2 e -x.(x-2A)V8 (l + o(1) ) + (\x- 2X\\- 1 / 2 e- x ^- 2X ^^ 

= <p x (x)(l + o(l)) + o({\ 1 ' 2 \x - 2A| 3 + A -1 / 2 |x - 2A|)e- A ^- 2A ) 2 / 8 ' 



The result follows by integrating; the O term yields, if we let Z ~ A r (2A,4A 1 ), 
0(E(X 1 / 2 \Z - 2A| 3 + \~ X I 2 \Z - 2A|)) = 0{X^). □ 

Lemma 9.8. For any random variables X and Y with density functions fx and 
fy, and any Borel set BC1, 

d TV (X,Y)< [ \fx(x) — /y(x)| dx + P(Y ^ B). 
Jb 

Proof. It is well-known, and easy to verify, that 

/oo 
\fx(x)- f Y (x)\dx. 
-oo 

Since f fx = 1 = / fy, we have 

/ l/x(x)-/y(x)|(ix< / (fx(x) + fy(x)) dx 
J B c J B c 

= 2F(Y $ B) + I (f x (x) - fy{x)) dx = 2P(Y $ B) - I (f x (x) - fy{x)) dx 
Jb c Jb 

and thus J K \f x (x) - fy(x)\ dx < 2¥(Y $ B) + 2 f B \f x (x) - fy(x)\ dx. □ 
Proof of Theorem\£3 If x > A > 0, then A( A - X )(2/) < A (0) (y) for y > and thus 

poo 

P(~( A ^')(A,oo) > 1) < EH( A - a ')(A,oo) = J k^-' x \y)dy 
< A (0) (y)dy = EH(°)(A,oo) ^0 



as A — > 00. Hence, by Corollary 18.51 £^ has a density function /i^ with h[ X \x) 
(l — o(l)) A^ A )(x) as A — > 00, uniformly in x > A. Since Lemma [9 . 71 implies A^ A ' 
0(1), this yields 



00 

h[ X) (x) - A W (x)|dx<EH(°)(A,oo) / A W (x) dx -> 



Hence Lemma 19.71 yields J^°\h\ X \x) — ip\(x) \ dx — ► as A — ► 00, and (i) follows by 
Lemma 19. 81 with B = (A, 00). 

For (ii), we observe that, by a simple extension of the proof of Corollary 18.51 
the conditional distribution C(E^ \ ^[ X ^ = x) equals the conditional distribution 
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C(5 X + E^ x x ) I E( x x \x,oo) = 0). Since the second largest point in 5 X + H^ A x \ 
when E <yX ~ x \x, oo) = 0, is the largest point x ^ in E^ x ~ x \ we have, in particular, 

£(^ A ) | ^ = x )= C(C[ X ' X) | C[ X ~ X) < x) . (9.4) 

Let A -> +oo, and assume A > 2. By (i), P(|^ A) - 2A| < 1) -»• 1. If \x - 2A| < 1 
and A' := A — x, then |A'— (— A)| < 1 and thus, by (|9.1j) . \a x — ay\ = 0(1/ X). Hence, 
it follows from Theorem 19. II that 

iA 2 ^ - o A = |A/A'| 2 (I|Afe! A,) - ay) + (|A/A'| 2 - l)a v + a* - a x 4 V. (9.5) 

Furthermore, IP(^ 1 A ^ < x) — ► 1, again by Theorem l9.11 and thus ()9.5[) holds also for 

the conditional distribution given ^~ x) < x. By flOJ) and P(|^ A) - 2A| < 1) 1, 
this yields, for every y, 

P(±A 2 ^ A) - a x < y) = EP(±A 2 ^ A) - a x < y \ - F(V < y), 
which proves (ii). □ 

Remark 9.9. Note that it is a fallacy to believe that Theorem 18.11 implies that 
conditioned on £^ = x has the distribution of 5 X + H^ -3 ^; as is seen in 
the proof above, the correct conclusion requires conditioning on E^ x ~ x \x, oo) = 0. 
Nevertheless, the proof also shows that the erroneous statement is asymptotically 
correct as A — > +oo: If £ has the distribution of £^ given in Corollary 18.51 or 
simply £ ~ N(2X, 2A" 1 ), and given £ we take a random E^ A then the distribution 
of + ~( A ~£) approximates that of 5^, and d T y(5 i + E^ x ^, =W) -> as A -> +oo. 

Remark 9.10. As remarked above, Theorem 19.51 implies asymptotic normality of 
the size of the largest component in G(n,p) in the case p = n^ 1 + A(n)n _4//3 with 
A(n) — > oo slowly (without specifying the allowed rate). Indeed, asymptotic nor- 
mality has been shown for all A(n) in the range A(n) — > oo but A(n) = 0(ra 1//3 ), 
i.e. p = 0(n _1 ), by Pittel H2] (p = c/n) and Pittel and Wormald [23] (the general 
case). 

Appendix A. Appendix: Point processes 

We give here some technical remarks on point processes; see e.g. ^3] an d 
Section 4] for further details and proofs. 

Let © be a 'nice' topological space (more precisely, a locally compact Polish 
space); in this paper we only consider the intervals (0, oo) and (0, oo] and their 
products with N or N*. Although we regard a point process as a random (multi)set 
C ©, it is technically convenient to formally define it as a random measure 
Y^i Hence, if E denotes the point process {£i}, we write E(^4) for the number of 
points £j that belong to a subset A C S; similarly, for suitable functions / on 6, 

Thus, let 91 = 91(6) be the class of all Borel measures fi on 6 such that fi(A) is a 
(finite) integer 0, 1, . . . for every relatively compact Borel set A; this coincides with 
the class of all finite or countably infinite sums of the type Yi > where i, S 6 
and each compact subset of & contains only a finite number of Xi, and we identify 
such a sum with the (multi)set {xi}. 

The standard topology on 9T (known as the vague topology) is defined such that, 
for fi,fj,i,fi 2 , ■ ■ ■ € % n n -> fj, if and only if / f d[i n -> / / dp for every / G C c (6), 
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the space of (real-valued) continuous functions on (5 with compact support. (This 
is a metrizable topology and 9T is a Polish space, see Section 15.7].) 

A point process on S is a random element of 91. If 5 is a point process on 6, 
there exists a unique Borel measure v on & such that E3(vl) = v{A) for every Borel 
set A, and more generally E J hdE, = J hdv for every positive measurable function 
h. This measure v is called the intensity of 5. In the cases we consider, & is an 
interval or a union of intervals, and v is absolutely continuous; then also the function 
dv/dx is called the intensity. 

If 3 n and H are point processes on 6, then 3 n -i S (w.r.t. the vague topology 

just defined) if and only if f f cG n —> f f dr. (as real-valued random variables) for 

every / £ C c (@). It is also true that 3 n — ► 3 if and only if r. n {A) —> r(A) for every 
relatively compact Borel set A C. & such that z.(dA) = a.s., and moreover joint 
convergence holds for every finite collection of such sets A. 

We state a particular case that we need. Say that a point x is a continuity point of 
a point process 3 if x is a continuity point of EH, i.e. if E3{x} = 0, or equivalently, 
x $l 3 a.s. 

Lemma A.l. ijf S n — ► 3 as pomi processes on an interval J, then 3 n [a, 6] — > H[a, 6] 
/or euery interval [a,b] C J smc/i i/tat a and 6 are continuity points o/S. □ 

Note that the definitions of both point processes and convergence of them are 
sensitive to the choice of &, since a point process is not allowed to have any cluster 
point in &. Hence, it matters whether boundary points are included in 6, even 
if they are not attained by any point. For example, if & is the closed interval 
[0, oo] (or any compact set), then every point process is finite. If, instead, & is the 
half-open interval (0, oo], then an element £ 91 is finite on every interval [a, oo], 
and thus every point process may be written as a (finite or infinite) set with 
oo > £i > £2 > • • • and, if the set is infinite, £j — > as i — > 00. Similarly, a point 
process on the open interval (0, 00) may have both and 00 as cluster points. By 
including one or both endpoints, we thus get stronger conditions, and, similarly, we 
get a stronger mode of convergence. It may thus be advantageous to consider (when 
possible) a set of points in (0, 00) as a point process on [0, 00), (0, 00] or [0, 00]. 

For point processes on a closed or half-open interval, with the points ordered as 
above, convergence is equivalent to joint convergence of the individual points. We 
state this for the case we are interested in. 

Lemma A. 2. There is a bisection between 91(0,00] and the space of sequences (£i)x° 
with £1 > £2 > • • • > and lim^oo = 0, such that E = {£i}f =l € (or, more 
formally, 3 = Y^,i=i <%^> with £1 > £2 > • • • an d < N < 00, corresponds to the 
sequence (£i)f° where we define £j := fori > N. This bijection is a homeomorphism 
between 9t with the vague topology and the space of sequences with component- wise 
convergence (i.e., the restriction of the product topology on [0, 00] 00 J. 

Consequently, j/H„, 1 < n < 00, are point processes on the interval (0, 00], and 
we write 3 n = {£ni}j=i with £ n i > £ n 2 > • • • and < N n < 00, and if some 
N n < 00, we further define £ n j = for i > N n , then 3 n —> 3^ if and only if 

(£ni> £n2j ■ ■ ■ ) — * (£ooi> £oo2> • • • )> i n the standard sense that all finite dimensional 
distributions converge. □ 
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